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□ 1: K01760. HSVl (KOS) glycop...[gi:330082] 



Related Sequences, Protein, PubMed, Taxonomy 



LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TIT LEA 

JOURNAL 
MEDLINE 
REFERENCE 
AUTHORS 



TITLE 



y 



JOURNAL 
MEDLINE 

REFERENCE 
AUTHORS 
JOURNAL 

COMMENT 



HSIGB 3758 bp DNA linear VRL 02-AUG-1993 

HSVl (KOS) glycoprotein B gene, complete cds . 
K01760 ^ 
K01760.1 GI:330082 
glycoprotein; glycoprotein B. 

Herpes simplex virus type 1 (strain KOS) DNA, maintained on HEL 
cells . 

human herpesvirus 1 

Viruses; dsDNA viruses, no RNA stage; Herpesviridae; , 
Alphaherpesvirinae; Simplexvirus . 

1 (bases 1 to 3758) 

Bzik:,D.J., Fox,B.A., DeLuca,N.A. and Person, S. 
Nucleotide sequence specifying the glycoprotein gene, gB, 
simplex virus typg^j^^,^^^^^^^^^^^^^^,^^ 
(v^oTogy"r3'3r"3 0 1 - 3 1 4 °lT9frr> 

2 (bases 1 to 3321) 

Bzik,D.J., Debroy,C., Fox,B.A., Pederson, N . E . and Person, S. 
The nucleotide sequence of the gB glycoprotein gene of HSV-2 
comparisQn-'wi^thM5rhe--GO.-rx,esponding gene of HSV-1 
TTrSTogT 155, 322-333 (198^)^ 
8^ 



FEATURES 

source 



CDS 



of Herpes 



and 



3 (bases 1 to 3311) 
Pederson, N . 
Unpublished (1987) 
[2] revises [1] . 
[3] revises [1] , [2] . 

Draft entry and computer-readable sequence for [3] kindly provided 
by N. Pederson 14-APR-1987. 

There are 4 antigenically distinct glycoproteins in HSVl infected 
cells — gB, gC, gD and gE. The glycoprotein formerly known as gA 
is now know to be a precursor to gB and has been renamed pgB. gB is 
the only glycoprotein that is known to be required for viral 
growth. The gB mRNA probably runs about from bases 257 to 3300. 
Strand shown runs in direction opposite to that of the standard HSV 
map, from mu ,370 to .345. 

Location/ Qualifiers 

1. .3758 

/organism="human herpesvirus 1" 
/db_xref="taxon: 10298" 
548. .3262 

/note="glycoprotein B precursor (pgB) " 
/ codon_start=l 
/protein id=" AAA4 5774 . 1 " 
/db_xref="GI : 330083" 

/translation="MHQGAPSWGRRWFVVWALLGLTLGVLVASAAPTSPGTPGVAAAT 
QAANGGPATPAPPPLGAAPTGDPKPKKNKKPKNPTPPRPAGDNATVAAGHATLREHLR 
DIKAENTDANFYVCPPPTGATWQFEQPRRCPTRPEGQNYTEGIAVVFKENIAPYKFK 
ATMYYKDVTVSQVWFGHRYSQFMGIFEDRAPVPFEEVIDKINAKGVCRSTAKYVRNNL 
ETTAFHRDDHETDMELKPANAATRTSRGWHTTDLKYNPSRVEAFHRYGTTVNCIVEEV 
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DARSVYPYDEFVLATGDFVYMSPFYGYREGSHTEHTTYAADRFKQVDGFYARDLTTKA 
RATAPTTRNLLTTPKFTVAWDWVPKRPSVCTMTKWQEVDEMLRSEYGGSFRFSSDAIS 
TTFTTNLTEYPLSRVDLGDCIGKDARDAMDRIFARRYNATHIKVGQPQYYQANGGFLI 
AYQPLLSNTLAELYVREHLREQSRKPPNPTPPPPGASANASVERIKTTSSIEFARLQF 
TYNHIQRHVNDMLGRVAIAWCELQNHELTLWNEARKLNPNAIASVTVGRRVSARMLGD 
VMAVSTCVPVAADNVIVQNSMRISSRPGACYSRPLVSFRYEDQGPLVEGQLGENNELR 
LTRDAIEPCTVGHRRYFTFGGGYVYFEEYAYSHQLSRADITTVSTFIDLNITMLEDHE 
FVPLEVYTRHEIKDSGLLDYTEVQRRNQLHDLRFADIDTVIHADANAAMFAGLGAFFE 
GMGDLGRAVGKWMGIVGGWSAVSGVSSFMSNPFGALAVGLLVLAGLAAAFFAFRYV 
MRLQSNPMKALYPLTTKELKNPTNPDASGEGEEGGDFDEAKLAEAREMIRYMALVSAM 
ERTEHKAKKKGTSALLSAKVTDMVMRKRRNTNYTQVPNKDGDADEDDL" 

variation 1364 

/note="g in KOS; a in mutant tsJ20" 

variation 167 6 

/note="c in KOS; t in mutant tsJ12" 
BASE COUNT 717 a 1286 c 1128 g 627 t 

ORIGIN 295 bp upstream of Xhol site; left half of BamHI-G frag at .370 mu. 

1 gtcaacgggc ccctctttga tcactccacc cacagcttcg cccagccccc caacaccgcg 
61 ctgtattaca gcgtcgagaa cgtggggctc ctgccgcacc tgaaggagga gctcgcccgg 
121 ttcatcatgg gggcgggggg ctcgggtgct gattgggccg tcagcgaatt tcagaggttt 
181 tactgttttg acggcatttc cggaataacg cccactcagc gcgccgcctg gcgatatatt 
241 cgcgagctga ttatcgccac cacactcttt gcctcggtct accggtgcgg ggagctcgag 
301 ttgcgccgcc cggactgcag ccgcccgacc tccgaaggtc gttaccgtta cccgcccggc 
361 gtatatctca cgtacgactc cgactgtccg ctggtggcca tcgtcgagag cgcccccgac 
421 ggctgtatcg gcccccggtc ggtcgtggtc tacgaccgcg acgttttctc gatcctctac 
481 tcggtcctcc agcacctcgc ccccaggcta cctgacgggg ggcacgacgg gcccccgtag 
541 tcccgccatg caccagggcg ccccctcgtg ggggcgccgg tggttcgtcg tatgggcgct 
601 cttggggttg acgctggggg tcctggtggc gtcggcggct ccgacttccc ccggcacgcc 
661 tggggtcgcg gccgcgaccc aggcggcgaa cgggggccct gccactccgg cgccgccgcc 
721 ccttggcgcc gccccaacgg gggacccgaa accgaagaag aacaaaaaac cgaaaaaccc 
781 aacgccacca cgccccgccg gcgacaacgc gaccgtcgcc gcgggccacg ccaccctgcg 
841 cgagcacctg cgggacatca aggcggagaa caccgatgca aacttttacg tgtgcccacc 
901 ccccacgggc gccacggtgg tgcagttcga gcagccgcgc cgctgcccga cccggcccga 
961 gggtcagaac tacacggagg gcatcgcggt ggtcttcaag gagaacatcg ccccgtacaa 
1021 gttcaaggcc accatgtact acaaagacgt caccgtttcg caggtgtggt tcggccaccg 
1081 ctactcccag tttatgggga tctttgagga ccgcgccccc gtccccttcg aggaggtgat 
1141 cgacaagatc aacgccaagg gggtctgtcg gtccacggcc aagtacgtgc gcaacaacct 
1201 ggagaccacc gcgtttcacc gggacgacca cgagaccgac atggagctga aaccggccaa 
1261 cgccgcgacc cgcacgagcc ggggctggca caccaccgac ctcaagtaca acccctcgcg 
1321 ggtggaggcg ttccaccggt acgggacgac ggtaaactgc atcgtcgagg aggtggacgc 
1381 gcgctcggtg tacccgtacg acgagtttgt gctggcgact ggcgactttg tgtacatgtc 
1441 cccgttttac ggctaccggg aggggtcgca caccgaacac accacgtacg ccgccgaccg 
1501 cttcaagcag gtcgacggct tctacgcgcg cgacctcacc accaaggccc gggccacggc 
1561 gccgaccacc cggaacctgc tcacgacccc caagttcacc gtggcctggg actgggtgcc 
1621 aaagcgcccg tcggtctgca ccatgaccaa gtggcaggaa gtggacgaga tgctgcgctc 
1681 cgagtacggc ggctccttcc gattctcctc cgacgccata tccaccacct tcaccaccaa 
1741 cctgaccgag tacccgctct cgcgcgtgga cctgggggac tgcatcggca aggacgcccg 
1801 cgacgccatg gaccgcatct tcgcccgcag gtacaacgcg acgcacatca aggtgggcca 
1861 gccgcagtac taccaggcca atgggggctt tctgatcgcg taccagcccc ttctcagcaa 
1921 cacgctcgcg gagctgtacg tgcgggaaca cctccgagag cagagccgca agcccccaaa 
1981 ccccacgccc ccgccgcccg gggccagcgc caacgcgtcc gtggagcgca tcaagaccac 
2041 ctcctccatc gagttcgccc ggctgcagtt tacgtacaac cacatacagc gccatgtcaa 
2101 cgatatgttg ggccgcgttg ccatcgcgtg gtgcgagcta cagaatcacg agctgaccct 
2161 gtggaacgag gcccgcaagc tgaaccccaa cgccatcgcc tcggtcaccg tgggccggcg 
2221 ggtgagcgcg cggatgctcg gcgacgtgat ggccgtctcc acgtgcgtgc cggtcgccgc 
2281 ggacaacgtg atcgtccaaa actcgatgcg catcagctcg cggcccgggg cctgctacag 
2341 ccgccccctg gtcagctttc ggtacgaaga ccagggcccg ttggtcgagg ggcagctggg 
2401 ggagaacaac gagctgcggc tgacgcgcga tgcgatcgag ccgtgcaccg tgggacaccg 
24 61 gcgctacttc accttcggtg ggggctacgt gtacttcgag gagtacgcgt actcccacca 
2521 gctgagccgc gccgacatca ccaccgtcag caccttcatc gacctcaaca tcaccatgct 
2581 ggaggatcac gagtttgtcc ccctggaggt gtacacccgc cacgagatca aggacagcgg 
2641 cctgctggac tacacggagg tccagcgccg caaccagctg cacgacctgc gcttcgccga 
2701 catcgacacg gtcatccacg ccgacgccaa cgccgccatg ttcgcgggcc tgggcgcgtt 
27 61 cttcgagggg atgggcgacc tggggcgcgc ggtcggcaag gtggtgatgg gcatcgtggg 
2821 cggcgtggta tcggccgtgt cgggcgtgtc ctccttcatg tccaacccct ttggggcgct 
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2881 
2941 
3001 
3061 
3121 
3181 
3241 
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3361 
3421 
3481 
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3601 
3661 
3721 



// 



ggccgtgggt 
catgcggctg 
gaaccccacc 
caagctagcc 
cacggaacac 
catggtcatg 
cgccgacgag 
ccgcatgcgc 
agaaaaagga 
gtgacaaaac 
tcaaaggctt 
aacatacacc 
gacatcccat 
acaaatcggc 
ggccccggcc 



ctgttggtcc 
cagagcaacc 
aacccggacg 
gaggccaggg 
aaggccaaga 
cgcaagcgcc 
gacgacctgt 
atcttttggt 
acacataaac 
ggaccccctt 
ttgcattgta 
aatgggtctt 
ggtaccaaag 
ggcgcgggcc 
tccgcgttgg 



tggccggcct 
ccatgaaggc 
cgtccgggga 
agatgatacg 
agaagggcac 
gcaacaccaa 
gacggggggt 
ttttttgttt 
tcccccgggt 
ggtcagtgcc 
aagctacccg 
atttggtatt 
accggggcga 
gtgtcaacgt 
atgctccggt 



ggcggcggcc 
cctgtaccct 
gggcgaggag 
gtacatggcc 
gagcgcgctg 
ctacacccaa 
ttgttgtaaa 
ggtcagcctt 
gtccgcggcc 
gatttcctcc 
cctacccgcc 
acctggttta 
atcagcgggc 
ccacgtgtgc 
tgggatcc 



ttcttcgcct 
ctaaccacca 
ggcggcgact 
ctggtgtcgg 
ctcagcgcca 
gttcccaaca 
taaaaaccac 
ttgtgtgtgt 
tgtttcctct 
cccccacgcc 
cctcccaata 
tttaaaaaga 
ccccatcatc 
tgcgctgctg 



ttcgttacgt 
aggagctcaa 
ttgacgaggc 
ccatggagcg 
aggtcaccga 
aagacggtga 
gggtgttaaa 
gtgggaagaa 
ttcctttccc 
ttcctccacg 
aaaaaaaaag 
tatacagtaa 
tgagagacga 
gcgttgacaa 
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Nucleotide sequence specifying the glycoprotein gene, gB, of herpes 
simplex virus type 1. 

Bzik DJ, Fox BA, DeLuca NA, Person S. 

The nucleotide sequence thought to specify the glycoprotein gene, gB, of the KOS 
strain of herpes simplex virus type 1 (HSV-1) has been determined. A 3.1-kilobase 
(kb), viral-specified RNA was mapped to the left half of the BamHI-G fragment 
(0.345 to 0.399 map units). TATA, CAT-box, and possible mRNA start sequences 
characteristic of HSV-1 genes are found near 0.368 map units. The first available 
ATG codon is at 0.366 and the first in-phase chain terminator at 0.348 map units, j 
polyA-addition signal (AATAAA) occurs 17 nucleotides past the chain terminator. 
Translation of these sequences w^ould yield a 100.3-kilodalton (kDa) polypeptide 
characterized by a 5' signal sequence, nine N-linked saccharide addition sites, a 
strongly hydrophobic membrane-spanning sequence, and a highly charged 3' 
cytoplasmic anchor sequence. Tv^o mutants of KOS, tsJ12 and tsJ20, that are 
temperature-sensitive for viral growth and for the production of gB, have been 
physically mapped to 0.357 to 0.360 and 0.360 to 0.364 map units, respectively 
(DeLuca et al., in preparation). The nucleotide sequence of the mutants was 
determined in these regions. In both cases a single amino acid replacement within 
the 100.3-kDa polypeptide is predicted from the sequence analysis. 

MeSH Terms: 

o Amino Acid Sequence 

o Base Sequence 

o Cell Line 

o DNA Restriction Enzymes 

o DNA, Recombinant/analysis 

o Genes, Structural* 

o Genes, Viral* 

o Human 

o Lung/embryology 

o Nucleic Acid Hybridization 

o Simplexvirus/genetics* 

o Support, Non-U.S. Gov't 

o Support, U.S. Gov't, P.H.S. 
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Substances: 
o DNA Restriction Enzymes 
o DNA, Recombinant 

Secondary source id: 
o GENBANK/K01760 

PMID: 6324454 [PubMed - indexed for MEDLINE] 
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□ 1: M21629. Herpes simplex vi...[gi:330109] 



Related Sequences, Protein, PubMed, Taxonomy 



He rpe s vi r i da ej 



Pauli,G,, Ludwig,H. and 



LOCUS HSIGLYB 9756 bp DNA linear VRL Ol-JUL-1994 

DEFINITION Herpes simplex virus type 1 glycoprotein B gene (gB-1), complete 
cds . 

ACCESSION M21629 

VERSION M21629.1 GI:330109 

KEYWORDS glycoprotein B. 

SOURCE Herpes simplex virus type 1 DNA. 

ORGANISM human herpesvirus 1 

_^ Viruses ;^,_ds DNA yiruses, . no RNA stage; 
Alphaherpesvirinae; Simplexvirus . 
REFERENCE 1 (bases 1 to 9756) 

AUTHORS Hammerschmidt, W. , Conraths,F., Mankert2,J 
Buhk,H.J. 

TITLE Conservation of a gene cluster including glycoprotein B in bovine 

herpesvirus type 2 (BHV-2) and herpes simplex virus type 1 (HSV-1) 
JOURNAL Virology 165 (2), 388-405 (1988) 
MEDLINE 88306231 
FEATURES Location/Qualifiers 
source 1 . . 9756 

/organism="human herpesvirus 1" 
/db_xref="taxon: 10298" 
gene 6555. . 9266 

/gene="gB-l" 
CDS 6555.. 9266 

/gene="gB-l" 
/codon_start=l 
/product="glycoprotein B" 
/protein id=" AAA194 96 . 1 " 
/db_xref="GI: 330110" 
* /translation="MRQG7\ARGCRWFVVWALLGLTLGVLVASAAPSSPGT?GVAA;(TQ" 
AANGGPATPAPPAPGPAPTGDTKPKKNKKPKNPPPPRPAGDNATVAAGHATLREHLRD 
IKAENTDANFYVCPPPTGATVVQFEQPRRCPTRPEGQNYTEGIAVVFKENIAPYKFKA 
TMYYKDVTVSQVWFGHRYSQFMGIFEDRAPVPFEEVIDKINAKGVCRSTAKYVRNNLE 
TTAFHRDDHETDMELKPANAATRTSRGWHTTDLKYNPSRVEAFHRYGTTVNCIVEEVD 
ARSVYPYDEFVLATGDFVYMSPFYGYREGSHTEHTSYAADRFKQVDGFYARDLTTKAR 
ATAPTTRNLLTTPKFTVAWDWVPKRPSVCTMTKWQEVDEMLRSEYGGSFRFSSDAIST 
TFTTNLTEYPLSRVDLGDCIGKDARDAMDRIFARRYNATHIKVGQPQYYLANGGFLIA 
YQPLLSNTLAELYVREHLREQSRKPPNPTPPPPGASANASVERIKTTSSIEFARLQFT 
YNHIQRHVNDMLGRVAIAWCELQNHELTLWNEARKLNPNAIASATVGRRVSARMLGDV 
MAVSTCVPVAADNVIVQNSMRISSRPGACYSRPLVSFRYEDQGPLVEGQVGENNELRL 
TRDAIEPCTVGHRRYFTFGGGYVYFEEYAYSHQLSRADITTVSTFIDLNITMLEDHEF 
VPLEVYTRHEIKDSGLLDYTEVQRRNQLHDLRFADIDTVIHADANAAMFAGLGAFFEG 
MGDLGRAVGKVVMGIVGGVVSAVSGVSSFMSNPFGALAVGLLVLAGLAAAFFAFRYVM 
RLQSNPMKALYPLTTKELKNPTNPDASGEGEEGGDFDEAKLAEAREMIRYMALVSAME 
RTEHKAKKKGTSALLSAKVTDMVMRKRRNTNYTQVPNKDGDADEDDL" 
3321 c 3202 g 1571 t 



1662 a 



BASE COUNT 
ORIGIN 

1 agtgtacata taccaaccgc ataccagacg cacccgaccc ggagcacctg accgtaagca 
61 tctgtgcctc tcgcagggac cccgcgttgc cggccgccgg ggttcatcgg cgcccccgtg 
121 gttacccggg gggttgtcgg tgaaggggag ggattcattc cccaaccccg gtctccaacc 
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181 
241 
301 
361 
421 
481 
541 
601 
661 
721 
781 
841 
901 
961 
1021 
1081 
1141 
1201 
1261 
1321 
1381 
1441 
1501 
1561 
1621 
1681 
1741 
1801 
1861 
1921 
1981 
2041 
2101 
2161 
2221 
2281 
2341 
2401 
2461 
2521 
2581 
2641 
2701 
2761 
2821 
2881 
2941 
3001 
3061 
3121 
3181 
3241 
3301 
3361 
3421 
3481 
3541 
3601 
3661 
3721 
3781 
3841 
3901 
3961 
4021 



ctccccttga 
ggcgtccgtc 
agacaaagcc 
acgctcgcgc 
gcgattccga 
aggccaacgt 
ccctgaaact 
acctggaccc 
attttggctt 
aggcgctgtg 
ccgagggctt 
acaaggtaac 
tcatgccgga 
gggagaaatt 
aggcggtcgt 
gcgcggccgc 
ttacgttcac 
gcggcaaggg 
ccgccctggc 
tctccgcgtg 
gggcgtacct 
ccctccatct 
cctcctttta 
accgcgaggg 
gaacccagga 
tggccaagat 
agatggacgt 
gcaccttcga 
atccaaagtt 
tgtacagcga 
acggatccga 
tcatggccga 
ggctggagac 
aggtcgtgga 
agtttcgcga 
cgtgcgggcc 
aggacctggc 
actttcgcaa 
ggtttctgtc 
ccagcctaac 
gcgtgaccct 
cgagcgccaa 
agaagcccga 
tccacgcggc 
ggttctggac 
tcgagaccat 
acctggcccc 
ggtactgcga 
gccgtccccc 
aggccggggc 
tgttcgccag 
ggttgagcat 
actgggccag 
gcaagttcgt 
gcggagcgca 
cggccgtcgc 
agctgcagat 
tggagctgac 
tggaggtggc 
ttaactttgg 
ccccgggacc 
aggggccccc 
gggggaacct 
gcttgtttgt 
gggaatagaa 



ccgtcgccgc 
gtcgtgccgg 
caagacggca 
gtgtccgtcc 
cgtcgccgtg 
ggccgtggtc 
gacgccctcg 
cagcacccag 
ttcggactac 
tgagcgcctc 
caaggaggcc 
cataggcggg 
ttttagccgt 
tacctacccg 
gggacccgcc 
ccacctggcg 
ggccttcgaa 
cgcggcgggc 
cctcgagtct 
gccgctgttc 
ggcgcgcgcc 
caccgaggtg 
ccgcttcttc 
acacgtggtg 
atttgccggc 
gctgttttac 
gtttcgatac 
cacgcgccac 
cgccagcgcc 
ctgcgacgtg 
gaccgcccgg 
actcgagacc 
catcatcacc 
ccgcgaggtg 
cggtctgggc 
gtgccccctg 
cctgagtcag 
tcaattccaa 
ggccaaaacg 
ggccggccag 
ggggtttccc 
cgcgtccgag 
caagcgcgtg 
catcttcccc 
ggccctccaa 
cgcgttcatt 
caacaacgtg 
tcactcgaca 
cagcgtgcag 
ccgcgcgctg 
ctgcaacctg 
cagcaagtac 
cctgatgggc 
cctggcctgt 
cgaaagctcg 
cagtagcgtg 
cgaggactgg 
cgcacgtgcc 
gcacgaggcc 
ggattttggc 
ggcatttgcc 
cgacaaaaag 
agggcggggc 
gttccgtgtc 
cagaaacgca 



ccccccccgg 
gattacctcc 
accaccatca 
gaaggcatcg 
gcgcccctgg 
gtgggttctc 
cactacagct 
gccccgaacc 
accccccggc 
ggcctggacc 
gtgtcgatca 
gcggaggtgc 
gtcatcgcag 
cttccgtttt 
gccgtggcac 
tttgacgaaa 
gccagccagg 
gggttcgaac 
atcgtgtcga 
gagggccagg 
gcgggactcg 
gacgacgccg 
ctcgtgcccg 
cccgggttcg 
gagcacctgg 
ctggagcgct 
gtcgcggact 
gcctgcgtac 
gcccgcggag 
ctgggaaact 
accatcatgc 
ctgcagtacg 
aaccgcgagg 
gagcagctga 
gaggccaacc 
cttcagcttc 
tgccacgggg 
ccggtgctgc 
ctgacggtcg 
acggcccccg 
aaggagctgc 
gccgccaagg 
gacatcctcc 
aacggcaagc 
cgcaaccagc 
aaaaagtttt 
agcgagctgg 
tacttcatca 
gctgcggccg 
gtggacgccg 
ctgcggcccg 
tacggcatgg 
ggcaaaaacg 
ccccgggccg 
ctgtgcgagc 
ttcgtggcga 
ctggcgctcc 
ctggagcgcg 
gaggccctag 
tgcgaggacg 
ggccgcaaac 
ggagacctga 
ggggaatgtg 
tatatctctg 
cgcgccgccg 



attttgacgc 
gttcgcggac 
aggtcccccc 
agcttctggc 
tcgtgggcct 
gcacgacggg 
cgtccgtgta 
tgacgcgcct 
ccggcgacct 
cggaccgcgc 
acaacacctt 
accgcatacc 
agccgttcaa 
ttaaccgccc 
tgcgatgccg 
accacgaggg 
gtaagacccc 
agcgcctggc 
tggccgtctt 
acacggccgc 
tgggggccat 
gcccggcgga 
ggacccacgt 
agggtcggcc 
ccatgctgtg 
gcgacggcgc 
ccaaccagac 
acacgacgct 
ccatcggcgt 
acgccgcctt 
aggagacgta 
tggaccaggc 
ccctgcatac 
tgcgcaacct 
acgccatgtc 
tcgggcggcg 
tgttcgccgg 
ggcggcgcgt 
cgctctcgga 
ccgagagcag 
gcgtcaagag 
cgcgggtcgc 
tcggaccgct 
ccccggggtc 
ttcccgcccg 
ccctggacta 
cgatgtacta 
acacccttac 
cgtggtccgc 
tggacgcgca 
tcatggcggc 
ccggcaacga 
cgtgcccgct 
ggtttgtgtg 
agctccgggg 
ccgtgaaaag 
tggaggacga 
gcaacggcga 
tcagccaact 
acaacgcgac 
gggcgttcca 
cgttggatat 
tgtaaaataa 
ggcgggccgt 
actcccggct 



tcgggagaca 
cgattgacaa 
cgggcccctg 
gttactgtcg 
gaccgtggag 
gctcgggggt 
cgtctttcac 
ttgcgagcgg 
caaacacgag 
cctcctgtat 
tctgcacctg 
cgtgtacccg 
cgccaaccac 
cctcaaccgc 
aaacgtggac 
cgccgccctc 
gcggggcgag 
ctccgtcatg 
tgacgagccg 
ggcccgcgcc 
ggtatttagc 
cccaaaggac 
ggcggccaac 
caccgcgccc 
tgggttttcc 
cgtgatcgtc 
cgacgtgccc 
catgcgcctc 
cttcgggacc 
ctcggccctg 
ccgcgcggcg 
ggtccccacg 
ggtggtgaac 
ggtggagggg 
cctgacgctg 
atccaacctc 
gcagtcggtc 
gatggacatg 

gggggcggct 
cttcgagggc 
ccgcgtgttg 
cagcctccag 
gggctttctg 
caaccagccg 
gctcctgtcg 
cggcgcgata 
catggcaaac 
ggccatcatc 
gcagggcggg 
tccgggcgcg 
gcgccccatg 
ccgtgtgttt 
ccttattttt 
cgcggcctcg 
cattatctcc 
cctggggccc 
gtacctaagc 
gtggtcgacg 
cggcaacgcc 
gccgttcggc 
cggggatgac 
gctgtgaggg 
attattgcta 
gattcctctc 
tgccggtcgg 



taccttgtcg 
aaggacatgg 
ggatacgtgt 
gcacgcagcg 
agcggctttg 
accgcggtgt 
ggcggccggc 
gcacgccgcc 
acgacggggg 
ctggtcgtta 
ggaggctcgg 
ttgcagctgt 
cgatcgatcg 
ctcctgttcg 
gccgtggccc 
cccgccgaca 
cgcgacggcg 
gccggagacg 
cccaccgaca 
aacgccgtcg 
accaactcgg 
cacagcaaac 
ccacaggtgg 
ctcgtcggcg 
ccggcgctgc 
gggcgccagg 
tgtaacctat 
cgggcgcgcc 
atgaacagca 
aagcgcgcgg 
accgagcgcg 
gccatggggc 
aacgtcaggc 
aggaacttca 
gacccgtacg 
gccgtgtacc 
gaggggcgca 
tttaacaacg 
atctgcgccc 
gacgttgccc 
ttcgcgggcg 
agcgcctacc 
ctgaagcagt 
aacccgcagt 
cgcgaggaca 
aactttatta 
cagattctgc 
gcggggtccc 
gcgggcctgg 
tggacgtcca 
gtcgtgttgg 
caggccggga 
gaccgcaccc 
agcctcggcg 
gagggcgggg 
cgcacccaac 
gaggagatga 
gacgcggccc 
ggggaggtgt 
ggcccggggg 
ccgtttgggg 
gttggggggt 
cgacatccgt 
cgcggtgtct 
cgggcccgcg 
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4081 
4141 
4201 
4261 
4321 
4381 
4441 
4501 
4561 
4621 
4681 
4741 
4801 
4861 
4921 
4981 
5041 
5101 
5161 
5221 
5281 
5341 
5401 
5461 
5521 
5581 
5641 
5701 
5761 
5821 
5881 
5941 
6001 
6061 
6121 
6181 
6241 
6301 
6361 
6421 
6481 
6541 
6601 
6661 
6721 
6781 
6841 
6901 
6961 
7021 
7081 
7141 
7201 
7261 
7321 
7381 
7441 
7501 
7561 
7621 
7681 
7741 
7801 
7861 
7921 



ggaggccgcc 
cgcctcggtc 
cggtgtccga 
cctatgtgtt 
tcccccaact 
gcctggaggc 
tagagtacgc 
cgggagacgt 
accaacgggt 
acgacgtcga 
gcgccaccgc 
ttatcgtccc 
tgtgtgtgac 
accacgtcac 
cccacgccgc 
cgctggcccg 
tccgcaagga 
gcctgtacgc 
ccaccatgga 
agaccgccgc 
gcgcgttcgg 
aggccacgtc 
tgacgacgcc 
gtcgcgtgct 
gccccgacga 
ccgggggcgg 
cgtgggggga 
cggaccgcct 
gggagctcga 
ccgcggtcgc 
gcacggtcat 
cgtctctcct 
tcgagctcgt 
acaccgcgct 
tcgcccggtt 
gaggttttac 
atatattcgc 
gctcgagttg 
gcccggcgta 
ccccgacggc 
cctctactcg 
cccgtagtcc 
cgctcttggg 
cgcctggggt 
ccgcccctgg 
acccaccgcc 
tgcgcgagca 
caccccccac 
ccgagggtca 
acaagttcaa 
accgctactc 
tgatcgacaa 
acctggagac 
ccaacgccgc 
cgcgggtgga 
acgcgcgctc 
tgtccccgtt 
accgcttcaa 
cggcgccgac 
tgccaaagcg 
gctccgagta 
ccaacctgac 
cccgcgacgc 
gccagccgca 
gcaacacgct 



ccgaataggg 
caccgcatac 
gcccaccgtg 
tcagatagag 
gaagctgaac 
ccaggccggg 
gtggcgcgag 
cgcggtgttt 
caggctggat 
gaactttcta 
cgcgctggag 
cccggagctt 
ggcgaaccag 
ccagcaggcg 
cgggctgtcg 
gaccgcgggg 
ggccgacgcc 
catcagcgaa 
cgcgtttgcc 
ggtggccgtg 
gagccacctg 
acccgacgat 
gctcttgcgg 
ggcgcgcatg 
cgacggggac 
gggcgaggac 
catcgccacg 
gacgaagcgg 
gaagatgctg 
caacggcttt 
cgacaaccgc 
gcgacaccag 
caacgggccc 
gtattacagc 
catcatgggg 
tgttttgacg 
gagctgatta 
cgccgcccgg 
tatctcacgt 
tgtatcggcc 
gtcctccagc 
cgccatgcgc 
gttgacgctg 
cgcggccgcg 
ccccgcccca 
gccgcgcccc 
cctgcgggac 
gggcgccacg 
gaactacacg 
ggccaccatg 
ccagtttatg 
gatcaacgcc 
caccgcgttt 
gacccgcacg 
ggcgttccac 
ggtgtacccg 
ttacggctac 
gcaggttgac 
cacccggaac 
cccgtcggtc 
cggcggctcc 
cgagtacccg 
catggaccgc 
gtactacctg 
cgcggagctg 



ggaccccggg 
tccggccgcg 
gcccgtcaaa 
ctgctccggc 
gcgcttcagg 
gcctttctca 
ggcgagcggc 
ttcacggaga 
acgtatggcg 
aagcagctaa 
cgcgttcggg 
agcgacccgt 
ggggcgacga 
caggtgcggc 
gacgccgacc 
ggcgacgggc 
ctgctggagg 
ttgcgattct 
agcaacctga 
gaactggcgc 
gcggcgctgg 
cagatcgagg 
cgcctcgtca 
ggggcggggg 
cgtgtcgccg 
gaagaccgtc 
caagcggccg 
tcgttggcca 
cgggtcagcg 
gcggcgcgcg 
tcggcgccgg 
gtggacccgg 
ctctttgatc 
gtcgagaacg 
gcgggggctc 
gcatttccgg 
tcgccaccac 
actgcagccg 
acgactccga 
cccggtcggt 
acctcgcccc 
cagggcgccg 
ggggtcctgg 
acccaggcgg 
acgggggaca 
gccggcgaca 
atcaaggcgg 
gtggtgcagt 
gagggcatcg 
tactacaaag 
gggatctttg 
aagggggtct 
caccgggacg 
agccggggct 
cggtacggga 
tacgacgagt 
cgggaggggt 
ggcttctacg 
ctgctcacga 
tgcaccatga 
ttccgattct 
ctctcgcgcg 
atcttcgccc 
gccaatgggg 
tacgtgcggg 



gctcagccag 
gtacagatcg 
agttgttagc 
ggtgcgaccc 
tgcgggcgct 
ccccgctgtc 
tcctgggcag 
ccatgggcct 
ggaccgtcca 
actactgcca 
agtttatggt 
cccacccctg 
tcgcccgccg 
tggacgccaa 
gcgcgcgggc 
agccccaccc 
cgcacgacgt 
ggctcgcgtc 
ccgcgctggc 
tgttcgggcg 
atatggtgga 
cgctcatccg 
gccccgaaca 
gcgcggcgga 
tagaggaagg 
gccgcgggcc 
cggacgtgcg 
gcctggggcg 
tccacggcga 
cgcgcttctg 
gcgtgttcga 
ccctgctccc 
actccaccca 
tggggctcct 
gggtgctgat 
aataacgccc 
actctttgcc 
cccgacctcc 
ctgtccgctg 
cgtggtctac 
caggctacct 
cgcgggggtg 
tggcgtcggc 
cgaacggggg 
cgaaaccgaa 
acgcgaccgt 
agaacaccga 
tcgagcagcc 
cggtggtctt 
acgtcaccgt 
aggaccgcgc 
gtcggtccac 
accacgagac 
ggcacaccac 
cgacggtaaa 
ttgtgctggc 
cgcacaccga 
cgcgcgacct 
cccccaagtt 
ccaagtggca 
cctccgacgc 
tggacctggg 
gcaggtacaa 
gctttctgat 
aacacctccg 



acgccgggta 
gcgccgcgag 
cctgctcgga 
ccacatcgga 
gcggcgtcgt 
ggtcaccctg 
cctggagacg 
ggcccgaccc 
tatggagctg 
cctcatcacc 
gggggcggtg 
cgcggtctgt 
cctggcggac 
cgagctgcgg 
gctctccgtg 
gtcgcccgag 
gtttcaggcc 
cggcgaccgc 
gcggcgcgag 
gcgggcggag 
cgccctaata 
cgcgtgctac 
gtgcgacgag 
cgcgcccaag 
ggcacggggg 
cggggggcag 
ggagcgacgg 
ctgcgtccgc 
ggtgctgccc 
cgccctgacg 
cgcgcaccgg 
cagcatcacc 
cagcttcgcc 
gccgcacctg 
tgggccgtca 
actcagcgcg 
tcggtctacc 
gaaggtcgtt 
gtggccatcg 
gaccgagacg 
gacggggggc 
ccggtggttc 
ggctccgagt 
acctgccact 
gaagaacaaa 
cgccgcgggc 
tgcaaacttt 
gcgccgctgc 
caaggagaac 
ttcgcaggtg 
ccccgtcccc 
ggccaagtac 
cgacatggag 
cgacctcaag 
ctgcatcgtc 
gactggcgac 
acacaccagc 
caccaccaag 
caccgtggcc 
ggaggtggac 
catatccacc 
ggactgcatc 
cgcgacgcac 
cgcgtaccag 
agagcagagc 



gcgtacggac 
atggccgccc 
caggtgcaga 
cgggggaagc 
ctgaggccgg 
gagttgctgc 
ttcgcgaccg 
tgtccgtatc 
tgtttcctgc 
ccctcgcgcg 
gggtcgggcc 
ttcgaggaac 
cgtatctgta 
cggtacctgc 
ttggaccatg 
aacgactcgg 
accacgcccg 
gccggccaga 
ttgcagcagg 
catttcgatc 
atcggcggtc 
gaccaccacc 
gaggcgctgc 
ggcggcgcgg 
ttgggagctc 
gggcccgaga 
cggctgtacg 
gagcagcgcg 
gcgacgttcg 
gcgggcgcgg 
ttcatgcgag 
catcgcttct 
cagcccccca 
aaggaggagc 
gcgaatttca 
ccgcctggcg 
ggtgcgggga 
accgttaccc 
tcgagagcgc 
ttttctcgat 
acgacgggcc 
gtcgtatggg 
tcccccggca 
ccggcgccgc 
aaaccgaaaa 
cacgccaccc 
tacgtgtgcc 
ccgacccggc 
atcgccccgt 
tggttcggcc 
ttcgaggagg 
gtgcgcaaca 
ctgaaaccgg 
tacaacccct 
gaggaggtgg 
tttgtgtaca 
tacgccgccg 
gcccgggcca 
tgggactggg 
gagatgctgc 
accttcacca 
ggcaaggacg 
atcaaggtgg 
ccccttctca 
cgcaagcccc 
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7981 
8041 
8101 
8161 
8221 
8281 
8341 
8401 
8461 
8521 
8581 
8641 
8701 
8761 
8821 
8881 
8941 
9001 
9061 
9121 
9181 
9241 
9301 
9361 
9421 
9481 
9541 
9601 
9661 
9721 



// 



caaaccccac 
ccacctcctc 
tcaacgatat 
ccctgtggaa 
ggcgggtgag 
ccgcggacaa 
acagccgccc 
tgggggagaa 
accggcgcta 
accagctgag 
tgctggagga 
gcggcctgct 
ccgacatcga 
cgttcttcga 
tgggcggcgt 
cgctggccgt 
acgtcatgcg 
tcaagaaccc 
aggccaagct 
agcgcacgga 
ccgacatggt 
gtgacgccga 
taaaccgcat 
aaagaaaagg 
ccgtgacaaa 
cgaaggcttt 
tacaccaatg 
tcccatggta 
atcggcggcg 
cccggcctcc 



gcccccgccg 
catcgagttc 
gttgggccgc 
cgaggcccgc 
cgcgcggatg 
cgtgatcgtc 
cctggtcagc 
caacgagctg 
cttcaccttc 
ccgcgccgac 
tcacgagttt 
ggactacacg 
cacggtcatc 
ggggatgggc 
ggtatcggcc 
gggtctgttg 
gctgcagagc 
caccaacccg 
agccgaggcc 
acacaaggcc 
catgcgcaag 
cgaggacgac 
gtgcatcttt 
gaacacataa 
acggaccccc 
tgcattgtaa 
ggtcttattt 
ccaaagaccg 
cgggccgtgt 
gcgttggatg 



cccggggcca 
gcccggctgc 
gttgccatcg 
aagctgaacc 
ctcggcgacg 
caaaactcga 
tttcggtacg 
cggctgacgc 
ggtgggggct 
atcaccaccg 
gtccccctgg 
gaggtccagc 
cacgccgacg 
gacctggggc 
gtgtcgggcg 
gtcctggccg 
aaccccatga 
gacgcgtccg 
cgggagatga 
aagaagaagg 
cgccgcaaca 
ctgtgacggg 
tggtttgttt 
actcccccgg 
ttggtcagtg 
agctacccgc 
ggtattacct 
gggcgaatca 
caacgtccac 
cctccggttg 



gcgccaacgc 
agtttacgta 
cgtggtgcga 
ccaacgccat 
tgatggccgt 
tgcgcatcag 
aagaccaggg 
gcgatgcgat 
acgtgtactt 
tcagcacctt 
aggtgtacac 
gccgcaacca 
ccaacgccgc 
gcgcggtcgg 
tgtcctcctt 
gcctggcggc 
aggccctgta 
gggagggcga 
tacggtacat 
gcacgagcgc 
ccaactacac 
gggtttgttg 
gtttggtacg 
gtgtccgcgg 
ccgattcccc 
ctacccgcgc 
ggtttattta 
gcgggccccc 
gtgtgctgcg 
ggatcc 



gtccgtggag 
caaccacata 
gctgcagaat 
cgcctcggcc 
ctccacgtgc 
ctcgcggccc 
cccgttggtc 
cgagccgtgc 
cgaggagtac 
catcgacctc 
ccgccacgag 
gctgcacgac 
catgttcgcg 
caaggtggtg 
catgtccaac 
ggctttcttc 
cccgctaacc 
ggagggcggc 
ggccctggtg 
gctgctcagc 
ccaagttccc 
taaataaaaa 
ccttttgtgt 
cctgtttcct 
ccccacgcct 
ctcccaataa 
aaaagatata 
atcatctgag 
ctgctggcgt 



cgcatcaaga 
cagcgccatg 
cacgagctga 
accgtgggcc 
gtgccggtcg 
ggggcctgct 
gaggggcagg 
accgtgggac 
gcgtactccc 
aacatcacca 
atcaaggaca 
ctgcgcttcg 
ggcctgggcg 
atgggcatcg 
ccctttgggg 
gcctttcgct 
accaaggagc 
gactttgacg 
tctgccatgg 
gccaaggtca 
aacaaagacg 
ccacgggtgt 
gtgtgggaag 
ctttcctttc 
tcctccacgt 
aaaaagaaca 
cagtaagaca 
agacgaacaa 
tgacaagggc 
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